AITopics | White Plains

Collaborating Authors

White Plains

A Review of Pseudospectral Optimal Control: From Theory to Flight

arXiv.org Artificial IntelligenceNov-27-2025

The home space for optimal control is a Sobolev space. The home space for pseudospectral theory is also a Sobolev space. It thus seems natural to combine pseudospectral theory with optimal control theory and construct ``pseudospectral optimal control theory,'' a term coined by Ross. In this paper, we review key theoretical results in pseudospectral optimal control that have proven to be critical for a successful flight. Implementation details of flight demonstrations onboard NASA spacecraft are discussed along with emerging trends and techniques in both theory and practice. The 2011 launch of pseudospectral optimal control in embedded platforms is changing the way in which we see solutions to challenging control problems in aerospace and autonomous systems.

artificial intelligence, ross, survey article, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.arcontrol.2012.09.002

2511.20843

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Monterey County > Monterey (0.04)
(15 more...)

Genre: Overview (0.86)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Aerospace & Defense (0.88)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Control Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

SemML: Enhancing Automata-Theoretic LTL Synthesis with Machine Learning

Kretinsky, Jan, Meggendorfer, Tobias, Prokop, Maximilian, Zarkhah, Ashkan

arXiv.org Artificial IntelligenceJan-29-2025

Synthesizing a reactive system from specifications given in linear temporal logic (LTL) is a classical problem, finding its applications in safety-critical systems design. We present our tool SemML, which won this year's LTL realizability tracks of SYNTCOMP, after years of domination by Strix. While both tools are based on the automata-theoretic approach, ours relies heavily on (i) Semantic labelling, additional information of logical nature, coming from recent LTL-to-automata translations and decorating the resulting parity game, and (ii) Machine-Learning approaches turning this information into a guidance oracle for on-the-fly exploration of the parity game (whence the name SemML). Our tool fills the missing gaps of previous suggestions to use such an oracle and provides an efficeint implementation with additional algorithmic improvements. We evaluate SemML both on the entire set of SYNTCOMP as well as a synthetic data set, compare it to Strix, and analyze the advantages and limitations. As SemML solves more instances on SYNTCOMP and does so significantly faster on larger instances, this demonstrates for the first time that machine-learning-aided approaches can out-perform state-of-the-art tools in real LTL synthesis.

logic & formal reasoning, machine learning, semml, (20 more...)

arXiv.org Artificial Intelligence

2501.17496

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(19 more...)

Genre: Research Report > New Finding (0.67)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.67)

Add feedback

Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models

Smilga, Veronika

arXiv.org Artificial IntelligenceJan-11-2025

Semantic leakage is a phenomenon recently introduced by Gonen et al. (2024). It refers to a situation in which associations learnt from the training data emerge in language model generations in an unexpected and sometimes undesired way. Prior work has focused on leakage in large language models (7B+ parameters). In this study, I use Qwen2.5 model family to explore whether smaller models, ranging from 500M to 7B parameters, demonstrate less semantic leakage due to their limited capacity for capturing complex associations. Building on the previous dataset from Gonen et al. (2024), I introduce a new dataset of color-focused prompts, categorized into specific types of semantic associations, to systematically evaluate the models' performance. Results indicate that smaller models exhibit less semantic leakage overall, although this trend is not strictly linear, with medium-sized models sometimes surpassing larger ones in leaking behavior. The dataset, the model generations, and the evaluation code are publicly available at https://github.com/smilni/semantic_leakage_project.

large language model, machine learning, qwen2, (19 more...)

arXiv.org Artificial Intelligence

2501.06638

Country:

North America > United States > West Virginia (0.04)
North America > United States > Washington > King County > Redmond (0.04)
North America > United States > Texas > Orange County > Orange (0.04)
(11 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Arnett, Catherine, Jones, Eliot, Yamshchikov, Ivan P., Langlais, Pierre-Carl

arXiv.org Artificial IntelligenceNov-18-2024

Open-source large language models are becoming increasingly available and popular among researchers and practitioners. While significant progress has been made on open-weight models, open training data is a practice yet to be adopted by the leading open-weight models creators. At the same time, there researchers are working to make language models safer. We propose a data curation pipeline to reduce harmful outputs by models trained on public domain data. There are unique challenges to working with public domain data, as these sources differ from web text in both form and content. Many sources are historical documents and are the result of Optical Character Recognition (OCR). Consequently, current state-of-the-art approaches to toxicity filtering are often infeasible or inappropriate for open data models. In this paper, we introduce a new fully open-source pipeline for open-data toxicity filtering. Our contributions are threefold. We create a custom training dataset, ToxicCommons, which is composed of texts which have been classified across five different dimensions (racial/origin-based, gender/sex-based, religious, ability-based discrimination, and violence). We use this dataset to train a custom classifier, Celadon, that can be used to detect toxic content in open data more efficiently at a larger scale. Finally, we describe the balanced approach to content filtration that optimizes safety filtering with respect to the filtered data available for training.

bias score, disability, violence, (14 more...)

arXiv.org Artificial Intelligence

2410.22587

Country:

North America > United States > Virginia (0.04)
North America > United States > Alaska (0.04)
Asia > China (0.04)
(24 more...)

Genre: Research Report (1.00)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Consumer Products & Services (0.67)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare

Pashangpour, Souren, Nejat, Goldie

arXiv.org Artificial IntelligenceNov-5-2024

The potential use of large language models (LLMs) in healthcare robotics can help address the significant demand put on healthcare systems around the world with respect to an aging demographic and a shortage of healthcare professionals. Even though LLMs have already been integrated into medicine to assist both clinicians and patients, the integration of LLMs within healthcare robots has not yet been explored for clinical settings. In this perspective paper, we investigate the groundbreaking developments in robotics and LLMs to uniquely identify the needed system requirements for designing health specific LLM based robots in terms of multi modal communication through human robot interactions (HRIs), semantic reasoning, and task planning. Furthermore, we discuss the ethical issues, open challenges, and potential future research directions for this emerging innovative field.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.3390/robotics13080112

2411.03287

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Ontario > Toronto (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(39 more...)

Genre:

Workflow (1.00)
Research Report > Experimental Study (1.00)
Overview (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Surgery (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Near-Linear Time Approximation Algorithm for Beyond-Worst-Case Graph Clustering

Cohen-Addad, Vincent, d'Orsi, Tommaso, Mousavifar, Aida

arXiv.org Artificial IntelligenceJun-7-2024

We consider the semi-random graph model of [Makarychev, Makarychev and Vijayaraghavan, STOC'12], where, given a random bipartite graph with $\alpha$ edges and an unknown bipartition $(A, B)$ of the vertex set, an adversary can add arbitrary edges inside each community and remove arbitrary edges from the cut $(A, B)$ (i.e. all adversarial changes are \textit{monotone} with respect to the bipartition). For this model, a polynomial time algorithm is known to approximate the Balanced Cut problem up to value $O(\alpha)$ [MMV'12] as long as the cut $(A, B)$ has size $\Omega(\alpha)$. However, it consists of slow subroutines requiring optimal solutions for logarithmically many semidefinite programs. We study the fine-grained complexity of the problem and present the first near-linear time algorithm that achieves similar performances to that of [MMV'12]. Our algorithm runs in time $O(|V(G)|^{1+o(1)} + |E(G)|^{1+o(1)})$ and finds a balanced cut of value $O(\alpha)$. Our approach appears easily extendible to related problem, such as Sparsest Cut, and also yields an near-linear time $O(1)$-approximation to Dagupta's objective function for hierarchical clustering [Dasgupta, STOC'16] for the semi-random hierarchical stochastic block model inputs of [Cohen-Addad, Kanade, Mallmann-Trenn, Mathieu, JACM'19].

algorithm, graph, vertex, (14 more...)

arXiv.org Artificial Intelligence

2406.04857

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > New York > Westchester County > White Plains (0.04)
(7 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.86)

Add feedback

From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency

Ohmer, Xenia, Bruni, Elia, Hupkes, Dieuwke

arXiv.org Artificial IntelligenceApr-18-2024

The staggering pace with which the capabilities of large language models (LLMs) are increasing, as measured by a range of commonly used natural language understanding (NLU) benchmarks, raises many questions regarding what "understanding" means for a language model and how it compares to human understanding. This is especially true since many LLMs are exclusively trained on text, casting doubt on whether their stellar benchmark performances are reflective of a true understanding of the problems represented by these benchmarks, or whether LLMs simply excel at uttering textual forms that correlate with what someone who understands the problem would say. In this philosophically inspired work, we aim to create some separation between form and meaning, with a series of tests that leverage the idea that world understanding should be consistent across presentational modes - inspired by Fregean senses - of the same meaning. Specifically, we focus on consistency across languages as well as paraphrases. Taking GPT-3.5 as our object of study, we evaluate multisense consistency across five different languages and various tasks. We start the evaluation in a controlled setting, asking the model for simple facts, and then proceed with an evaluation on four popular NLU benchmarks. We find that the model's multisense consistency is lacking and run several follow-up analyses to verify that this lack of consistency is due to a sense-dependent task understanding. We conclude that, in this aspect, the understanding of LLMs is still quite far from being consistent and human-like, and deliberate on how this impacts their utility in the context of learning about human language and understanding.

computational linguistic, consistency, proceedings, (15 more...)

arXiv.org Artificial Intelligence

2404.12145

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > Westchester County > White Plains (0.14)
Europe > Sweden > Vaestra Goetaland > Gothenburg (0.14)
(27 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Education (0.67)
Leisure & Entertainment > Sports > Olympic Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unique therapy helps some young people with autism interact better with others

FOX NewsApr-13-2024, 17:53:23 GMT

Fox News contributor Dr. Marc Siegel unpacks a report suggesting a drug used primarily for seizures and behavioral issues could help treat autism. A New York speech pathologist is using improvisational theater, better known as "improv," to help young adults with autism spectrum disorder (ASD) to develop their social skills. Bob Domingo, PhD, a speech language pathologist and assistant professor at Long Island University Post in Brookville, New York, is combining his skills and love of improv to help those with ASD. "Through improv, I am able to combine my knowledge of speech, language and communication with improv games and activities, to open up new, fun ways to communicate with others in developing spontaneous, unscripted'scenes' or conversations," Domingo told Fox News Digital in an interview. For individuals with ASD, symptoms can vary in severity.

domingo, fox new digital, improv, (11 more...)

FOX News

Country:

North America > United States > New York > Westchester County > White Plains (0.05)
North America > United States > Maryland > Baltimore (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

RFK Jr. speaks candidly about his gravelly voice

Los Angeles TimesApr-8-2024, 10:00:14 GMT

There was a time before the turn of the millennium when Robert F. Kennedy Jr. gave a full-throated accounting of himself and the things he cared about. He recalls his voice then as "unusually strong," so much so that he could fill large auditoriums with his words. The independent presidential candidate recounts those times somewhat wistfully, telling interviewers that he "can't stand" the sound of his voice today -- sometimes choked, halting and slightly tremulous. Spasmodic dysphonia, a rare neurological condition, in which an abnormality in the brain's neural network results in involuntary spasms of the muscles that open or close the vocal cords. My my voice doesn't really get tired. "I feel sorry for the people who have to listen to me," Kennedy said in a phone interview with The Times, his voice sounding as strained as it does in his public appearances.

kennedy, rfk jr, vaccine, (12 more...)

Los Angeles Times

Country:

North America > United States > New York > Westchester County > White Plains (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)

Genre: Personal > Obituary (0.48)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

A Unified Gaussian Process for Branching and Nested Hyperparameter Optimization

Zhang, Jiazhao, Hung, Ying, Lin, Chung-Ching, Liu, Zicheng

arXiv.org Artificial IntelligenceJan-19-2024

Choosing appropriate hyperparameters plays a crucial role in the success of neural networks as hyper-parameters directly control the behavior and performance of the training algorithms. To obtain efficient tuning, Bayesian optimization methods based on Gaussian process (GP) models are widely used. Despite numerous applications of Bayesian optimization in deep learning, the existing methodologies are developed based on a convenient but restrictive assumption that the tuning parameters are independent of each other. However, tuning parameters with conditional dependence are common in practice. In this paper, we focus on two types of them: branching and nested parameters. Nested parameters refer to those tuning parameters that exist only within a particular setting of another tuning parameter, and a parameter within which other parameters are nested is called a branching parameter. To capture the conditional dependence between branching and nested parameters, a unified Bayesian optimization framework is proposed. The sufficient conditions are rigorously derived to guarantee the validity of the kernel function, and the asymptotic convergence of the proposed optimization framework is proven under the continuum-armed-bandit setting. Based on the new GP model, which accounts for the dependent structure among input variables through a new kernel function, higher prediction accuracy and better optimization efficiency are observed in a series of synthetic simulations and real data applications of neural networks. Sensitivity analysis is also performed to provide insights into how changes in hyperparameter values affect prediction accuracy.

accuracy, optimization, procedure, (15 more...)

arXiv.org Artificial Intelligence

2402.04885

Country:

North America > United States > New York > Westchester County > White Plains (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback